智能论文笔记

Approximating Pandora's Box with Correlations

Shuchi Chawla , Evangelia Gergatsouli , Jeremy McMahan , Christos Tzamos

分类：机器学习

2021-08-30

我们在盒子值上相关的分布中重新审视经典的潘多拉盒（PB）问题。 ARXIV的最新工作：1911.01632获得了限制性类别的策略量持续近似算法，该策略以固定顺序访问框。在这项工作中，我们研究了近似最佳策略的复杂性，该策略可以根据迄今为止所看到的值适应下一步访问哪个框。我们的主要结果确定了PB的近似值等效性与研究良好的统一决策树（UDT）问题，从随机优化和Min-Sum Set封面的变体（$ \ MATHCAL {MSSC} _F $）问题。对于支持$ M $的分布，UDT承认$ \ log M $近似值，而多项式时间的恒定因子近似是一个长期的开放问题，但在次指数时间内可以实现恒定的因子近似值（ARXIV：1906.11385）。我们的主要结果意味着PB和$ \ MATHCAL {MSSC} _F $具有相同的属性。我们还研究了一个案例，使价值分布更简洁地作为$ m $产品分布的混合物。这个问题再次与最佳决策树的嘈杂变体有关，该变体更具挑战性。我们给出一个恒定的因子近似值，该近似时间$ n^{\ tilde o（m^2/\ varepsilon^2）} $当每个盒子上的混合组件在电视距离中相同或通过$ \ varepsilon $在电视距离中相同或分开。

translated by 谷歌翻译

A Hyperspectral and RGB Dataset for Building Facade Segmentation

Nariman Habili , Ernest Kwan , Weihao Li , Christfried Webers , Jeremy Oorloff , Mohammad Ali Armin , Lars Petersson

分类：计算机视觉

2022-12-06

Hyperspectral Imaging (HSI) provides detailed spectral information and has been utilised in many real-world applications. This work introduces an HSI dataset of building facades in a light industry environment with the aim of classifying different building materials in a scene. The dataset is called the Light Industrial Building HSI (LIB-HSI) dataset. This dataset consists of nine categories and 44 classes. In this study, we investigated deep learning based semantic segmentation algorithms on RGB and hyperspectral images to classify various building materials, such as timber, brick and concrete.

translated by 谷歌翻译

Adaptive Sequential Surveillance with Network and Temporal Dependence

Ivana Malenica , Jeremy R. Coyle , Mark J. van der Laan , Maya L. Petersen

分类： (统计)机器学习

2022-12-05

Strategic test allocation plays a major role in the control of both emerging and existing pandemics (e.g., COVID-19, HIV). Widespread testing supports effective epidemic control by (1) reducing transmission via identifying cases, and (2) tracking outbreak dynamics to inform targeted interventions. However, infectious disease surveillance presents unique statistical challenges. For instance, the true outcome of interest - one's positive infectious status, is often a latent variable. In addition, presence of both network and temporal dependence reduces the data to a single observation. As testing entire populations regularly is neither efficient nor feasible, standard approaches to testing recommend simple rule-based testing strategies (e.g., symptom based, contact tracing), without taking into account individual risk. In this work, we study an adaptive sequential design involving n individuals over a period of {\tau} time-steps, which allows for unspecified dependence among individuals and across time. Our causal target parameter is the mean latent outcome we would have obtained after one time-step, if, starting at time t given the observed past, we had carried out a stochastic intervention that maximizes the outcome under a resource constraint. We propose an Online Super Learner for adaptive sequential surveillance that learns the optimal choice of tests strategies over time while adapting to the current state of the outbreak. Relying on a series of working models, the proposed method learns across samples, through time, or both: based on the underlying (unknown) structure in the data. We present an identification result for the latent outcome in terms of the observed data, and demonstrate the superior performance of the proposed strategy in a simulation modeling a residential university environment during the COVID-19 pandemic.

translated by 谷歌翻译

The Cause of Causal Emergence: Redistribution of Uncertainty

Liye Jia , Cong Zhou , Ka Lok Man , Sheng-Uei Guan , Jeremy Smith , Yutao Yue

分类：人工智能

2022-12-03

It is crucial to choose the appropriate scale in order to build an effective and informational representation of a complex system. Scientists carefully choose the scales for their experiments to extract the variables that describe the causalities in the system. They found that the coarse scale(macro) is sometimes more causal and informative than the numerous-parameter observations(micro). The phenomenon that the causality emerges by coarse-graining is called Causal Emergence(CE). Based on information theory, a number of recent works quantitatively showed that CE indeed happens while coarse-graining a micro model to the macro. However, the existing works have not discussed the question of why and when the CE happens. We quantitatively analyze the redistribution of uncertainties for coarse-graining and suggest that the redistribution of uncertainties is the cause of causal emergence. We further analyze the thresholds that determine if CE happens or not. From the regularity of the transition probability matrix(TPM) of discrete systems, the mathematical expressions of the model properties are derived. The values of thresholds for different operations are computed. The results provide the critical and specific conditions of CE as helpful suggestions for choosing the proper coarse-graining operation. The results also provided a new way to better understand the nature of causality and causal emergence.

translated by 谷歌翻译

Differentially Private Adaptive Optimization with Delayed Preconditioners

Tian Li , Manzil Zaheer , Ken Ziyu Liu , Sashank J. Reddi , H. Brendan McMahan , Virginia Smith

分类：机器学习

2022-12-01

Privacy noise may negate the benefits of using adaptive optimizers in differentially private model training. Prior works typically address this issue by using auxiliary information (e.g., public data) to boost the effectiveness of adaptive optimization. In this work, we explore techniques to estimate and efficiently adapt to gradient geometry in private adaptive optimization without auxiliary data. Motivated by the observation that adaptive methods can tolerate stale preconditioners, we propose differentially private adaptive training with delayed preconditioners (DP^2), a simple method that constructs delayed but less noisy preconditioners to better realize the benefits of adaptivity. Theoretically, we provide convergence guarantees for our method for both convex and non-convex problems, and analyze trade-offs between delay and privacy noise reduction. Empirically, we explore DP^2 across several real-world datasets, demonstrating that it can improve convergence speed by as much as 4x relative to non-adaptive baselines and match the performance of state-of-the-art optimization methods that require auxiliary data.

translated by 谷歌翻译

AI Enabled Maneuver Identification via the Maneuver Identification Challenge

Kaira Samuel , Matthew LaRosa , Kyle McAlpin , Morgan Schaefer , Brandon Swenson , Devin Wasilefsky , Yan Wu , Dan Zhao , Jeremy Kepner

分类：人工智能

2022-11-28

Artificial intelligence (AI) has enormous potential to improve Air Force pilot training by providing actionable feedback to pilot trainees on the quality of their maneuvers and enabling instructor-less flying familiarization for early-stage trainees in low-cost simulators. Historically, AI challenges consisting of data, problem descriptions, and example code have been critical to fueling AI breakthroughs. The Department of the Air Force-Massachusetts Institute of Technology AI Accelerator (DAF-MIT AI Accelerator) developed such an AI challenge using real-world Air Force flight simulator data. The Maneuver ID challenge assembled thousands of virtual reality simulator flight recordings collected by actual Air Force student pilots at Pilot Training Next (PTN). This dataset has been publicly released at Maneuver-ID.mit.edu and represents the first of its kind public release of USAF flight training data. Using this dataset, we have applied a variety of AI methods to separate "good" vs "bad" simulator data and categorize and characterize maneuvers. These data, algorithms, and software are being released as baselines of model performance for others to build upon to enable the AI ecosystem for flight simulator training.

translated by 谷歌翻译

Detecting Elevated Air Pollution Levels by Monitoring Web Search Queries: Deep Learning-Based Time Series Forecasting

Chen Lin , Safoora Yousefi , Elvis Kahoro , Payam Karisani , Donghai Liang , Jeremy Sarnat , Eugene Agichtein

分类：机器学习

2022-11-09

Real-time air pollution monitoring is a valuable tool for public health and environmental surveillance. In recent years, there has been a dramatic increase in air pollution forecasting and monitoring research using artificial neural networks (ANNs). Most of the prior work relied on modeling pollutant concentrations collected from ground-based monitors and meteorological data for long-term forecasting of outdoor ozone, oxides of nitrogen, and PM2.5. Given that traditional, highly sophisticated air quality monitors are expensive and are not universally available, these models cannot adequately serve those not living near pollutant monitoring sites. Furthermore, because prior models were built on physical measurement data collected from sensors, they may not be suitable for predicting public health effects experienced from pollution exposure. This study aims to develop and validate models to nowcast the observed pollution levels using Web search data, which is publicly available in near real-time from major search engines. We developed novel machine learning-based models using both traditional supervised classification methods and state-of-the-art deep learning methods to detect elevated air pollution levels at the US city level, by using generally available meteorological data and aggregate Web-based search volume data derived from Google Trends. We validated the performance of these methods by predicting three critical air pollutants (ozone (O3), nitrogen dioxide (NO2), and fine particulate matter (PM2.5)), across ten major U.S. metropolitan statistical areas (MSAs) in 2017 and 2018.

translated by 谷歌翻译

Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories

Samarth Sinha , Roman Shapovalov , Jeremy Reizenstein , Ignacio Rocco , Natalia Neverova , Andrea Vedaldi , David Novotny

分类：计算机视觉

2022-11-07

Obtaining photorealistic reconstructions of objects from sparse views is inherently ambiguous and can only be achieved by learning suitable reconstruction priors. Earlier works on sparse rigid object reconstruction successfully learned such priors from large datasets such as CO3D. In this paper, we extend this approach to dynamic objects. We use cats and dogs as a representative example and introduce Common Pets in 3D (CoP3D), a collection of crowd-sourced videos showing around 4,200 distinct pets. CoP3D is one of the first large-scale datasets for benchmarking non-rigid 3D reconstruction "in the wild". We also propose Tracker-NeRF, a method for learning 4D reconstruction from our dataset. At test time, given a small number of video frames of an unseen object, Tracker-NeRF predicts the trajectories of its 3D points and generates new views, interpolating viewpoint and time. Results on CoP3D reveal significantly better non-rigid new-view synthesis performance than existing baselines.

translated by 谷歌翻译

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

Andrey Ignatov , Radu Timofte , Maurizio Denna , Abdel Younes , Ganzorig Gankhuyag , Jingang Huh , Myeong Kyun Kim , Kihwan Yoon , Hyeon-Cheol Moon , Seungho Lee

分类：计算机视觉

2022-11-07

Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper.

translated by 谷歌翻译

Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions

Jeremy Ocampo , Matthew A. Price , Jason D. McEwen

分类：计算机视觉 | 机器学习

2022-09-27

现有的球形卷积神经网络（CNN）框架在计算方面既可以扩展又是旋转等值的。连续的方法捕获旋转模棱两可，但通常在计算上是过时的。离散的方法提供了更有利的计算性能，但付出了损失。我们开发了一个混合离散（迪斯科）组卷积，该卷积同时均具有等效性，并且在计算上可扩展到高分辨率。虽然我们的框架可以应用于任何紧凑的组，但我们专注于球体。我们的迪斯科球形卷积不仅表现出$ \ text {so}（3）$ rotational equivariance，而且还表现出一种渐近$ \ text {so}（3）/\ text {so}（so}（so}（2）$ rotationation eporational ecorivarianciancience，对于许多应用程序（其中$ \ text {so}（n）$是特殊的正交组，代表$ n $ dimensions中的旋转）。通过稀疏的张量实现，我们可以在球体上的像素数量进行线性缩放，以供计算成本和内存使用情况。对于4K球形图像，与最有效的替代替代品量球卷积相比，我们意识到节省了$ 10^9 $的计算成本和$ 10^4 $的内存使用情况。我们将迪斯科球形CNN框架应用于球体上的许多基准密集预测问题，例如语义分割和深度估计，在所有这些问题上，我们都达到了最先进的性能。

translated by 谷歌翻译